Tree Edit Distance Cannot be Computed in Strongly Subcubic Time (unless APSP can)
نویسندگان
چکیده
The edit distance between two rooted ordered trees with n nodes labeled from an alphabet Σ is the minimum cost of transforming one tree into the other by a sequence of elementary operations consisting of deleting and relabeling existing nodes, as well as inserting new nodes. Tree edit distance is a well known generalization of string edit distance. The fastest known algorithm for tree edit distance runs in cubic O(n) time and is based on a similar dynamic programming solution as string edit distance. In this paper we show that a truly subcubic O(n3−ε) time algorithm for tree edit distance is unlikely: For |Σ| = Ω(n), a truly subcubic algorithm for tree edit distance implies a truly subcubic algorithm for the all pairs shortest paths problem. For |Σ| = O(1), a truly subcubic algorithm for tree edit distance implies an O(nk−ε) algorithm for finding a maximum weight k-clique. Thus, while in terms of upper bounds string edit distance and tree edit distance are highly related, in terms of lower bounds string edit distance exhibits the hardness of the strong exponential time hypothesis [Backurs, Indyk STOC’15] whereas tree edit distance exhibits the hardness of all pairs shortest paths. Our result provides a matching conditional lower bound for one of the last remaining classic dynamic programming problems. The collaboration on this paper was established during Dagstuhl seminar 16451. ∗Max Planck Institute for Informatics, Saarland Informatics Campus †University of Haifa. Partially supported by the Israel Science Foundation grants 794/13 and 592/17. ‡IDC Herzliya. Partially supported by the Israel Science Foundation grants 794/13 and 592/17.
منابع مشابه
Subcubic Equivalences Between APSP, Co-Diameter, and Other Complementary Problems∗†
Despite persistent effort, there is no known technique for obtaining super-linear lower bounds for the computational complexity of the problems in P. Vassilevska Williams and Williams [38] introduce a fruitful approach to advance a better understanding of the computational complexity of the problems in P. In particular, they consider All Pairs Shortest Paths (APSP) and other fundamental problem...
متن کاملSubcubic Equivalences Between Graph Centrality Problems, APSP and Diameter
Measuring the importance of a node in a network is a major goal in the analysis of social networks, biological systems, transportation networks etc. Different centrality measures have been proposed to capture the notion of node importance. For example, the center of a graph is a node that minimizes the maximum distance to any other node (the latter distance is the radius of the graph). The medi...
متن کاملFast Computation of the Tree Edit Distance between Unordered Trees Using IP Solvers
We propose a new method for computing the tree edit distance between two unordered trees by problem encoding. Our method transforms an instance of the computation into an instance of some IP problems and solves it by an efficient IP solver. The tree edit distance is defined as the minimum cost of a sequence of edit operations (either substitution, deletion, or insertion) to transform a tree int...
متن کاملThe Intractability of Computing the Hamming Distance
Given a string x and a language L, the Hamming distance of x to L is the minimum Hamming distance of x to any string in L. The edit distance of a string to a language is analogously defined. First, we prove that there is a language in AC such that both Hamming and edit distance to this language are hard to approximate; they cannot be approximated with factor O(n 1 3 − ), for any > 0, unless P =...
متن کاملAn Improved Algorithm for Tree Edit Distance Incorporating Structural Linearity
An ordered labeled tree is a tree in which the nodes are labeled and the left-to-right order among siblings is significant. The edit distance between two ordered labeled trees is the minimum cost of transforming one tree into the other by a sequence of edit operations. Among the best known tree edit distance algorithms, the majority can be categorized in terms of a framework named cover strateg...
متن کامل